Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 2550 |
| Missing cells | 1176 |
| Missing cells (%) | 2.4% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 378.6 KiB |
| Average record size in memory | 152.1 B |
Variable types
| Numeric | 11 |
|---|---|
| Categorical | 8 |
LABEL2013 has constant value "Urban" | Constant |
LABEL2014 has constant value "Urban" | Constant |
LABEL2015 has constant value "Urban" | Constant |
LABEL2016 has constant value "Urban" | Constant |
LABEL2017 has constant value "Urban" | Constant |
LABEL2018 has constant value "Urban" | Constant |
LABEL2019 has constant value "Urban" | Constant |
LABEL2020 has constant value "Urban" | Constant |
df_index is highly correlated with LAT and 4 other fields | High correlation |
LON is highly correlated with df_index and 8 other fields | High correlation |
2013 is highly correlated with df_index and 8 other fields | High correlation |
2014 is highly correlated with df_index and 9 other fields | High correlation |
2015 is highly correlated with df_index and 8 other fields | High correlation |
2016 is highly correlated with LON and 7 other fields | High correlation |
2017 is highly correlated with LON and 7 other fields | High correlation |
2018 is highly correlated with LON and 7 other fields | High correlation |
2019 is highly correlated with LON and 7 other fields | High correlation |
2020 is highly correlated with 2013 and 6 other fields | High correlation |
LABEL2020 is highly correlated with LABEL2017 and 6 other fields | High correlation |
LABEL2017 is highly correlated with LABEL2020 and 6 other fields | High correlation |
LABEL2016 is highly correlated with LABEL2020 and 6 other fields | High correlation |
LABEL2018 is highly correlated with LABEL2020 and 6 other fields | High correlation |
LABEL2019 is highly correlated with LABEL2020 and 6 other fields | High correlation |
LABEL2014 is highly correlated with LABEL2020 and 6 other fields | High correlation |
LABEL2013 is highly correlated with LABEL2020 and 6 other fields | High correlation |
LABEL2015 is highly correlated with LABEL2020 and 6 other fields | High correlation |
LAT is highly correlated with df_index and 2 other fields | High correlation |
LABEL2013 has 230 (9.0%) missing values | Missing |
LABEL2014 has 200 (7.8%) missing values | Missing |
LABEL2015 has 194 (7.6%) missing values | Missing |
LABEL2016 has 190 (7.5%) missing values | Missing |
LABEL2017 has 179 (7.0%) missing values | Missing |
LABEL2018 has 118 (4.6%) missing values | Missing |
LABEL2019 has 65 (2.5%) missing values | Missing |
df_index has unique values | Unique |
Reproduction
| Analysis started | 2022-09-22 15:21:57.776620 |
|---|---|
| Analysis finished | 2022-09-22 15:22:14.502286 |
| Duration | 16.73 seconds |
| Software version | pandas-profiling v3.3.0 |
| Download configuration | config.json |
| Distinct | 2550 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12901.11804 |
| Minimum | 107 |
|---|---|
| Maximum | 25800 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.0 KiB |
Quantile statistics
| Minimum | 107 |
|---|---|
| 5-th percentile | 6984.45 |
| Q1 | 10530.25 |
| median | 12940.5 |
| Q3 | 15419.75 |
| 95-th percentile | 18046.1 |
| Maximum | 25800 |
| Range | 25693 |
| Interquartile range (IQR) | 4889.5 |
Descriptive statistics
| Standard deviation | 3937.916179 |
|---|---|
| Coefficient of variation (CV) | 0.3052383651 |
| Kurtosis | 1.662250451 |
| Mean | 12901.11804 |
| Median Absolute Deviation (MAD) | 2462 |
| Skewness | -0.1172750626 |
| Sum | 32897851 |
| Variance | 15507183.83 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 107 | 1 | < 0.1% |
| 14550 | 1 | < 0.1% |
| 14543 | 1 | < 0.1% |
| 14544 | 1 | < 0.1% |
| 14545 | 1 | < 0.1% |
| 14546 | 1 | < 0.1% |
| 14547 | 1 | < 0.1% |
| 14548 | 1 | < 0.1% |
| 14549 | 1 | < 0.1% |
| 14551 | 1 | < 0.1% |
| Other values (2540) | 2540 |
| Value | Count | Frequency (%) |
| 107 | 1 | |
| 108 | 1 | |
| 143 | 1 | |
| 246 | 1 | |
| 247 | 1 | |
| 347 | 1 | |
| 348 | 1 | |
| 425 | 1 | |
| 426 | 1 | |
| 427 | 1 |
| Value | Count | Frequency (%) |
| 25800 | 1 | |
| 25774 | 1 | |
| 25473 | 1 | |
| 25455 | 1 | |
| 25453 | 1 | |
| 25399 | 1 | |
| 25382 | 1 | |
| 25381 | 1 | |
| 25380 | 1 | |
| 25379 | 1 |
| Distinct | 100 |
|---|---|
| Distinct (%) | 3.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.43357255 |
| Minimum | 17.0275 |
|---|---|
| Maximum | 17.7475 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.0 KiB |
Quantile statistics
| Minimum | 17.0275 |
|---|---|
| 5-th percentile | 17.3075 |
| Q1 | 17.3775 |
| median | 17.4425 |
| Q3 | 17.4975 |
| 95-th percentile | 17.5425 |
| Maximum | 17.7475 |
| Range | 0.72 |
| Interquartile range (IQR) | 0.12 |
Descriptive statistics
| Standard deviation | 0.08394416492 |
|---|---|
| Coefficient of variation (CV) | 0.004815086792 |
| Kurtosis | 1.99202693 |
| Mean | 17.43357255 |
| Median Absolute Deviation (MAD) | 0.06 |
| Skewness | -0.6344688667 |
| Sum | 44455.61 |
| Variance | 0.007046622824 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 17.5075 | 65 | 2.5% |
| 17.4975 | 62 | 2.4% |
| 17.5125 | 62 | 2.4% |
| 17.5025 | 61 | 2.4% |
| 17.4525 | 61 | 2.4% |
| 17.4925 | 61 | 2.4% |
| 17.4825 | 60 | 2.4% |
| 17.4475 | 60 | 2.4% |
| 17.4575 | 59 | 2.3% |
| 17.4625 | 59 | 2.3% |
| Other values (90) | 1940 |
| Value | Count | Frequency (%) |
| 17.0275 | 2 | |
| 17.0625 | 2 | |
| 17.0675 | 4 | |
| 17.0725 | 4 | |
| 17.0775 | 4 | |
| 17.0825 | 1 | < 0.1% |
| 17.0875 | 1 | < 0.1% |
| 17.0975 | 2 | |
| 17.1275 | 2 | |
| 17.1525 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 17.7475 | 1 | < 0.1% |
| 17.7425 | 2 | 0.1% |
| 17.6875 | 1 | < 0.1% |
| 17.6425 | 3 | |
| 17.6375 | 5 | |
| 17.6325 | 6 | |
| 17.6275 | 4 | |
| 17.6225 | 5 | |
| 17.6175 | 6 | |
| 17.6125 | 5 |
| Distinct | 139 |
|---|---|
| Distinct (%) | 5.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 78.465233 |
| Minimum | 78.0475 |
|---|---|
| Maximum | 78.92751 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.0 KiB |
Quantile statistics
| Minimum | 78.0475 |
|---|---|
| 5-th percentile | 78.28751 |
| Q1 | 78.3975 |
| median | 78.4675 |
| Q3 | 78.53751 |
| 95-th percentile | 78.6125 |
| Maximum | 78.92751 |
| Range | 0.88001 |
| Interquartile range (IQR) | 0.14001 |
Descriptive statistics
| Standard deviation | 0.1193077387 |
|---|---|
| Coefficient of variation (CV) | 0.001520517229 |
| Kurtosis | 2.605373412 |
| Mean | 78.465233 |
| Median Absolute Deviation (MAD) | 0.07001 |
| Skewness | -0.008685703397 |
| Sum | 200086.3441 |
| Variance | 0.01423433651 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 78.42751 | 53 | 2.1% |
| 78.4325 | 53 | 2.1% |
| 78.4875 | 52 | 2.0% |
| 78.48251 | 50 | 2.0% |
| 78.4625 | 50 | 2.0% |
| 78.4225 | 49 | 1.9% |
| 78.4775 | 49 | 1.9% |
| 78.4175 | 48 | 1.9% |
| 78.4375 | 48 | 1.9% |
| 78.5175 | 48 | 1.9% |
| Other values (129) | 2050 |
| Value | Count | Frequency (%) |
| 78.0475 | 2 | 0.1% |
| 78.05251 | 1 | < 0.1% |
| 78.0625 | 2 | 0.1% |
| 78.0675 | 2 | 0.1% |
| 78.0725 | 6 | |
| 78.0775 | 11 | |
| 78.0825 | 6 | |
| 78.0875 | 5 | |
| 78.1125 | 1 | < 0.1% |
| 78.1175 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 78.92751 | 1 | < 0.1% |
| 78.9225 | 1 | < 0.1% |
| 78.9025 | 1 | < 0.1% |
| 78.8975 | 3 | 0.1% |
| 78.8925 | 8 | |
| 78.8875 | 7 | |
| 78.8825 | 7 | |
| 78.8775 | 5 | |
| 78.87251 | 3 | 0.1% |
| 78.8375 | 1 | < 0.1% |
| Distinct | 2414 |
|---|---|
| Distinct (%) | 94.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3287112235 |
| Minimum | 0.16032 |
|---|---|
| Maximum | 0.58204 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.0 KiB |
Quantile statistics
| Minimum | 0.16032 |
|---|---|
| 5-th percentile | 0.2175865 |
| Q1 | 0.27691 |
| median | 0.323695 |
| Q3 | 0.3755075 |
| 95-th percentile | 0.4574415 |
| Maximum | 0.58204 |
| Range | 0.42172 |
| Interquartile range (IQR) | 0.0985975 |
Descriptive statistics
| Standard deviation | 0.07074401917 |
|---|---|
| Coefficient of variation (CV) | 0.2152163179 |
| Kurtosis | -0.2585913859 |
| Mean | 0.3287112235 |
| Median Absolute Deviation (MAD) | 0.049015 |
| Skewness | 0.2833181672 |
| Sum | 838.21362 |
| Variance | 0.005004716248 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.28616 | 4 | 0.2% |
| 0.27346 | 3 | 0.1% |
| 0.41829 | 3 | 0.1% |
| 0.33072 | 3 | 0.1% |
| 0.29692 | 3 | 0.1% |
| 0.22639 | 2 | 0.1% |
| 0.29234 | 2 | 0.1% |
| 0.21337 | 2 | 0.1% |
| 0.39562 | 2 | 0.1% |
| 0.32785 | 2 | 0.1% |
| Other values (2404) | 2524 |
| Value | Count | Frequency (%) |
| 0.16032 | 1 | |
| 0.16302 | 1 | |
| 0.16714 | 1 | |
| 0.1673 | 1 | |
| 0.16744 | 1 | |
| 0.17128 | 1 | |
| 0.17191 | 1 | |
| 0.17381 | 1 | |
| 0.17499 | 1 | |
| 0.17686 | 1 |
| Value | Count | Frequency (%) |
| 0.58204 | 1 | |
| 0.56936 | 1 | |
| 0.54841 | 1 | |
| 0.53817 | 1 | |
| 0.53354 | 1 | |
| 0.52991 | 1 | |
| 0.52731 | 1 | |
| 0.52646 | 1 | |
| 0.52012 | 1 | |
| 0.51869 | 1 |
| Distinct | 2431 |
|---|---|
| Distinct (%) | 95.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.323322102 |
| Minimum | 0.16165 |
|---|---|
| Maximum | 0.58526 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.0 KiB |
Quantile statistics
| Minimum | 0.16165 |
|---|---|
| 5-th percentile | 0.2175325 |
| Q1 | 0.272355 |
| median | 0.318215 |
| Q3 | 0.3669125 |
| 95-th percentile | 0.448162 |
| Maximum | 0.58526 |
| Range | 0.42361 |
| Interquartile range (IQR) | 0.0945575 |
Descriptive statistics
| Standard deviation | 0.06898725546 |
|---|---|
| Coefficient of variation (CV) | 0.2133700574 |
| Kurtosis | -0.07208338358 |
| Mean | 0.323322102 |
| Median Absolute Deviation (MAD) | 0.047255 |
| Skewness | 0.395791906 |
| Sum | 824.47136 |
| Variance | 0.004759241416 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.38254 | 4 | 0.2% |
| 0.27308 | 3 | 0.1% |
| 0.2746 | 3 | 0.1% |
| 0.27109 | 3 | 0.1% |
| 0.33969 | 3 | 0.1% |
| 0.31084 | 2 | 0.1% |
| 0.29409 | 2 | 0.1% |
| 0.34752 | 2 | 0.1% |
| 0.31754 | 2 | 0.1% |
| 0.33371 | 2 | 0.1% |
| Other values (2421) | 2524 |
| Value | Count | Frequency (%) |
| 0.16165 | 1 | |
| 0.16378 | 1 | |
| 0.16593 | 1 | |
| 0.16685 | 1 | |
| 0.16855 | 1 | |
| 0.16961 | 1 | |
| 0.17039 | 1 | |
| 0.17096 | 1 | |
| 0.17518 | 1 | |
| 0.17618 | 1 |
| Value | Count | Frequency (%) |
| 0.58526 | 1 | |
| 0.58046 | 1 | |
| 0.54243 | 1 | |
| 0.52757 | 1 | |
| 0.5244 | 1 | |
| 0.52159 | 1 | |
| 0.52057 | 1 | |
| 0.51903 | 1 | |
| 0.51898 | 1 | |
| 0.51754 | 1 |
| Distinct | 2408 |
|---|---|
| Distinct (%) | 94.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3209652549 |
| Minimum | 0.16686 |
|---|---|
| Maximum | 0.5692 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.0 KiB |
Quantile statistics
| Minimum | 0.16686 |
|---|---|
| 5-th percentile | 0.2187695 |
| Q1 | 0.2730075 |
| median | 0.316755 |
| Q3 | 0.3651425 |
| 95-th percentile | 0.439987 |
| Maximum | 0.5692 |
| Range | 0.40234 |
| Interquartile range (IQR) | 0.092135 |
Descriptive statistics
| Standard deviation | 0.06598958445 |
|---|---|
| Coefficient of variation (CV) | 0.2055972833 |
| Kurtosis | -0.173715908 |
| Mean | 0.3209652549 |
| Median Absolute Deviation (MAD) | 0.04584 |
| Skewness | 0.3201133772 |
| Sum | 818.4614 |
| Variance | 0.004354625255 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.3164 | 3 | 0.1% |
| 0.40373 | 3 | 0.1% |
| 0.27054 | 3 | 0.1% |
| 0.36793 | 3 | 0.1% |
| 0.27798 | 3 | 0.1% |
| 0.35868 | 3 | 0.1% |
| 0.30401 | 3 | 0.1% |
| 0.40993 | 3 | 0.1% |
| 0.33349 | 3 | 0.1% |
| 0.36318 | 2 | 0.1% |
| Other values (2398) | 2521 |
| Value | Count | Frequency (%) |
| 0.16686 | 1 | |
| 0.16788 | 1 | |
| 0.16921 | 1 | |
| 0.17173 | 1 | |
| 0.17237 | 1 | |
| 0.17397 | 1 | |
| 0.17455 | 1 | |
| 0.17611 | 1 | |
| 0.17692 | 1 | |
| 0.17768 | 1 |
| Value | Count | Frequency (%) |
| 0.5692 | 1 | |
| 0.55331 | 1 | |
| 0.53364 | 1 | |
| 0.52682 | 1 | |
| 0.52634 | 1 | |
| 0.51979 | 1 | |
| 0.51608 | 1 | |
| 0.50335 | 1 | |
| 0.50183 | 1 | |
| 0.50118 | 1 |
| Distinct | 2415 |
|---|---|
| Distinct (%) | 94.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.306381 |
| Minimum | 0.15874 |
|---|---|
| Maximum | 0.54005 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.0 KiB |
Quantile statistics
| Minimum | 0.15874 |
|---|---|
| 5-th percentile | 0.209906 |
| Q1 | 0.2605125 |
| median | 0.300795 |
| Q3 | 0.3490625 |
| 95-th percentile | 0.421138 |
| Maximum | 0.54005 |
| Range | 0.38131 |
| Interquartile range (IQR) | 0.08855 |
Descriptive statistics
| Standard deviation | 0.06319287678 |
|---|---|
| Coefficient of variation (CV) | 0.2062558605 |
| Kurtosis | -0.2428108851 |
| Mean | 0.306381 |
| Median Absolute Deviation (MAD) | 0.04434 |
| Skewness | 0.3544407113 |
| Sum | 781.27155 |
| Variance | 0.003993339676 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.34755 | 3 | 0.1% |
| 0.34984 | 3 | 0.1% |
| 0.35404 | 3 | 0.1% |
| 0.29107 | 3 | 0.1% |
| 0.27848 | 3 | 0.1% |
| 0.28084 | 3 | 0.1% |
| 0.32901 | 3 | 0.1% |
| 0.2809 | 2 | 0.1% |
| 0.34212 | 2 | 0.1% |
| 0.38992 | 2 | 0.1% |
| Other values (2405) | 2523 |
| Value | Count | Frequency (%) |
| 0.15874 | 1 | |
| 0.16007 | 1 | |
| 0.16173 | 1 | |
| 0.16235 | 1 | |
| 0.16336 | 1 | |
| 0.16437 | 1 | |
| 0.16604 | 1 | |
| 0.16619 | 1 | |
| 0.16786 | 1 | |
| 0.17231 | 1 |
| Value | Count | Frequency (%) |
| 0.54005 | 1 | |
| 0.53122 | 1 | |
| 0.52233 | 1 | |
| 0.50076 | 1 | |
| 0.50047 | 1 | |
| 0.49507 | 1 | |
| 0.48752 | 1 | |
| 0.48245 | 1 | |
| 0.4806 | 1 | |
| 0.47457 | 1 |
| Distinct | 2429 |
|---|---|
| Distinct (%) | 95.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3234497882 |
| Minimum | 0.16212 |
|---|---|
| Maximum | 0.58771 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.0 KiB |
Quantile statistics
| Minimum | 0.16212 |
|---|---|
| 5-th percentile | 0.2167565 |
| Q1 | 0.2707375 |
| median | 0.317835 |
| Q3 | 0.37045 |
| 95-th percentile | 0.4543505 |
| Maximum | 0.58771 |
| Range | 0.42559 |
| Interquartile range (IQR) | 0.0997125 |
Descriptive statistics
| Standard deviation | 0.07130246953 |
|---|---|
| Coefficient of variation (CV) | 0.2204437045 |
| Kurtosis | -0.09581256326 |
| Mean | 0.3234497882 |
| Median Absolute Deviation (MAD) | 0.049665 |
| Skewness | 0.4227081944 |
| Sum | 824.79696 |
| Variance | 0.005084042161 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.29951 | 2 | 0.1% |
| 0.26605 | 2 | 0.1% |
| 0.24887 | 2 | 0.1% |
| 0.36301 | 2 | 0.1% |
| 0.39209 | 2 | 0.1% |
| 0.25568 | 2 | 0.1% |
| 0.35837 | 2 | 0.1% |
| 0.28268 | 2 | 0.1% |
| 0.347 | 2 | 0.1% |
| 0.26277 | 2 | 0.1% |
| Other values (2419) | 2530 |
| Value | Count | Frequency (%) |
| 0.16212 | 1 | |
| 0.16384 | 1 | |
| 0.1645 | 1 | |
| 0.16538 | 1 | |
| 0.16564 | 1 | |
| 0.16904 | 1 | |
| 0.1711 | 1 | |
| 0.1725 | 1 | |
| 0.17286 | 1 | |
| 0.17436 | 1 |
| Value | Count | Frequency (%) |
| 0.58771 | 1 | |
| 0.56642 | 1 | |
| 0.56352 | 1 | |
| 0.56123 | 1 | |
| 0.55649 | 1 | |
| 0.54909 | 1 | |
| 0.53513 | 1 | |
| 0.53177 | 1 | |
| 0.52764 | 2 | |
| 0.52692 | 1 |
| Distinct | 2422 |
|---|---|
| Distinct (%) | 95.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3117158863 |
| Minimum | 0.161 |
|---|---|
| Maximum | 0.56912 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.0 KiB |
Quantile statistics
| Minimum | 0.161 |
|---|---|
| 5-th percentile | 0.213819 |
| Q1 | 0.263945 |
| median | 0.30481 |
| Q3 | 0.354005 |
| 95-th percentile | 0.4345975 |
| Maximum | 0.56912 |
| Range | 0.40812 |
| Interquartile range (IQR) | 0.09006 |
Descriptive statistics
| Standard deviation | 0.06563081211 |
|---|---|
| Coefficient of variation (CV) | 0.2105468954 |
| Kurtosis | -0.009656848504 |
| Mean | 0.3117158863 |
| Median Absolute Deviation (MAD) | 0.04426 |
| Skewness | 0.4847883394 |
| Sum | 794.87551 |
| Variance | 0.004307403498 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.33194 | 3 | 0.1% |
| 0.30637 | 3 | 0.1% |
| 0.25603 | 3 | 0.1% |
| 0.26413 | 2 | 0.1% |
| 0.29749 | 2 | 0.1% |
| 0.28603 | 2 | 0.1% |
| 0.31095 | 2 | 0.1% |
| 0.26284 | 2 | 0.1% |
| 0.29327 | 2 | 0.1% |
| 0.29245 | 2 | 0.1% |
| Other values (2412) | 2527 |
| Value | Count | Frequency (%) |
| 0.161 | 1 | |
| 0.16128 | 1 | |
| 0.16357 | 1 | |
| 0.16476 | 1 | |
| 0.16607 | 1 | |
| 0.16652 | 1 | |
| 0.16965 | 1 | |
| 0.17032 | 1 | |
| 0.17073 | 1 | |
| 0.17109 | 1 |
| Value | Count | Frequency (%) |
| 0.56912 | 1 | |
| 0.55591 | 1 | |
| 0.54069 | 1 | |
| 0.5166 | 1 | |
| 0.51463 | 1 | |
| 0.50617 | 1 | |
| 0.50608 | 1 | |
| 0.50382 | 1 | |
| 0.50322 | 1 | |
| 0.50268 | 1 |
| Distinct | 2404 |
|---|---|
| Distinct (%) | 94.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3095043961 |
| Minimum | 0.16108 |
|---|---|
| Maximum | 0.55003 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.0 KiB |
Quantile statistics
| Minimum | 0.16108 |
|---|---|
| 5-th percentile | 0.2153725 |
| Q1 | 0.2632075 |
| median | 0.301755 |
| Q3 | 0.3505525 |
| 95-th percentile | 0.43216 |
| Maximum | 0.55003 |
| Range | 0.38895 |
| Interquartile range (IQR) | 0.087345 |
Descriptive statistics
| Standard deviation | 0.06509410737 |
|---|---|
| Coefficient of variation (CV) | 0.2103172304 |
| Kurtosis | 0.1722441244 |
| Mean | 0.3095043961 |
| Median Absolute Deviation (MAD) | 0.042895 |
| Skewness | 0.5751257865 |
| Sum | 789.23621 |
| Variance | 0.004237242814 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.29154 | 3 | 0.1% |
| 0.27034 | 3 | 0.1% |
| 0.32341 | 3 | 0.1% |
| 0.34081 | 3 | 0.1% |
| 0.32226 | 3 | 0.1% |
| 0.27142 | 3 | 0.1% |
| 0.26464 | 3 | 0.1% |
| 0.32272 | 3 | 0.1% |
| 0.33507 | 3 | 0.1% |
| 0.40432 | 3 | 0.1% |
| Other values (2394) | 2520 |
| Value | Count | Frequency (%) |
| 0.16108 | 1 | |
| 0.16312 | 1 | |
| 0.16329 | 1 | |
| 0.16448 | 1 | |
| 0.16602 | 1 | |
| 0.16701 | 1 | |
| 0.16941 | 1 | |
| 0.16964 | 1 | |
| 0.16981 | 1 | |
| 0.16988 | 1 |
| Value | Count | Frequency (%) |
| 0.55003 | 1 | |
| 0.53706 | 1 | |
| 0.53432 | 1 | |
| 0.53238 | 1 | |
| 0.52987 | 1 | |
| 0.5215 | 1 | |
| 0.51546 | 1 | |
| 0.51525 | 1 | |
| 0.51509 | 1 | |
| 0.51507 | 1 |
| Distinct | 2401 |
|---|---|
| Distinct (%) | 94.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3386187373 |
| Minimum | 0.17588 |
|---|---|
| Maximum | 0.58955 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.0 KiB |
Quantile statistics
| Minimum | 0.17588 |
|---|---|
| 5-th percentile | 0.2370135 |
| Q1 | 0.28543 |
| median | 0.32747 |
| Q3 | 0.3826475 |
| 95-th percentile | 0.475871 |
| Maximum | 0.58955 |
| Range | 0.41367 |
| Interquartile range (IQR) | 0.0972175 |
Descriptive statistics
| Standard deviation | 0.07226835371 |
|---|---|
| Coefficient of variation (CV) | 0.2134210124 |
| Kurtosis | -0.01108442795 |
| Mean | 0.3386187373 |
| Median Absolute Deviation (MAD) | 0.04702 |
| Skewness | 0.5884365333 |
| Sum | 863.47778 |
| Variance | 0.005222714948 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.30222 | 3 | 0.1% |
| 0.28577 | 3 | 0.1% |
| 0.30689 | 3 | 0.1% |
| 0.28546 | 3 | 0.1% |
| 0.30962 | 3 | 0.1% |
| 0.30923 | 3 | 0.1% |
| 0.31877 | 2 | 0.1% |
| 0.36627 | 2 | 0.1% |
| 0.45222 | 2 | 0.1% |
| 0.36681 | 2 | 0.1% |
| Other values (2391) | 2524 |
| Value | Count | Frequency (%) |
| 0.17588 | 1 | |
| 0.17939 | 1 | |
| 0.17948 | 1 | |
| 0.18124 | 1 | |
| 0.18263 | 1 | |
| 0.1857 | 1 | |
| 0.1861 | 1 | |
| 0.18618 | 1 | |
| 0.18716 | 1 | |
| 0.18736 | 1 |
| Value | Count | Frequency (%) |
| 0.58955 | 1 | |
| 0.57205 | 1 | |
| 0.5684 | 1 | |
| 0.56286 | 1 | |
| 0.56236 | 1 | |
| 0.56224 | 1 | |
| 0.56012 | 1 | |
| 0.55977 | 1 | |
| 0.55794 | 1 | |
| 0.55218 | 1 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 230 |
| Missing (%) | 9.0% |
| Memory size | 20.0 KiB |
| Urban |
|---|
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 11600 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Urban |
|---|---|
| 2nd row | Urban |
| 3rd row | Urban |
| 4th row | Urban |
| 5th row | Urban |
Common Values
| Value | Count | Frequency (%) |
| Urban | 2320 | |
| (Missing) | 230 | 9.0% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| urban | 2320 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 2320 | |
| r | 2320 | |
| b | 2320 | |
| a | 2320 | |
| n | 2320 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9280 | |
| Uppercase Letter | 2320 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2320 | |
| b | 2320 | |
| a | 2320 | |
| n | 2320 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 2320 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11600 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 2320 | |
| r | 2320 | |
| b | 2320 | |
| a | 2320 | |
| n | 2320 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11600 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 2320 | |
| r | 2320 | |
| b | 2320 | |
| a | 2320 | |
| n | 2320 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 200 |
| Missing (%) | 7.8% |
| Memory size | 20.0 KiB |
| Urban |
|---|
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 11750 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Urban |
|---|---|
| 2nd row | Urban |
| 3rd row | Urban |
| 4th row | Urban |
| 5th row | Urban |
Common Values
| Value | Count | Frequency (%) |
| Urban | 2350 | |
| (Missing) | 200 | 7.8% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| urban | 2350 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 2350 | |
| r | 2350 | |
| b | 2350 | |
| a | 2350 | |
| n | 2350 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9400 | |
| Uppercase Letter | 2350 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2350 | |
| b | 2350 | |
| a | 2350 | |
| n | 2350 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 2350 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11750 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 2350 | |
| r | 2350 | |
| b | 2350 | |
| a | 2350 | |
| n | 2350 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11750 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 2350 | |
| r | 2350 | |
| b | 2350 | |
| a | 2350 | |
| n | 2350 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 194 |
| Missing (%) | 7.6% |
| Memory size | 20.0 KiB |
| Urban |
|---|
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 11780 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Urban |
|---|---|
| 2nd row | Urban |
| 3rd row | Urban |
| 4th row | Urban |
| 5th row | Urban |
Common Values
| Value | Count | Frequency (%) |
| Urban | 2356 | |
| (Missing) | 194 | 7.6% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| urban | 2356 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 2356 | |
| r | 2356 | |
| b | 2356 | |
| a | 2356 | |
| n | 2356 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9424 | |
| Uppercase Letter | 2356 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2356 | |
| b | 2356 | |
| a | 2356 | |
| n | 2356 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 2356 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11780 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 2356 | |
| r | 2356 | |
| b | 2356 | |
| a | 2356 | |
| n | 2356 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11780 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 2356 | |
| r | 2356 | |
| b | 2356 | |
| a | 2356 | |
| n | 2356 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 190 |
| Missing (%) | 7.5% |
| Memory size | 20.0 KiB |
| Urban |
|---|
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 11800 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Urban |
|---|---|
| 2nd row | Urban |
| 3rd row | Urban |
| 4th row | Urban |
| 5th row | Urban |
Common Values
| Value | Count | Frequency (%) |
| Urban | 2360 | |
| (Missing) | 190 | 7.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| urban | 2360 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 2360 | |
| r | 2360 | |
| b | 2360 | |
| a | 2360 | |
| n | 2360 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9440 | |
| Uppercase Letter | 2360 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2360 | |
| b | 2360 | |
| a | 2360 | |
| n | 2360 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 2360 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11800 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 2360 | |
| r | 2360 | |
| b | 2360 | |
| a | 2360 | |
| n | 2360 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11800 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 2360 | |
| r | 2360 | |
| b | 2360 | |
| a | 2360 | |
| n | 2360 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 179 |
| Missing (%) | 7.0% |
| Memory size | 20.0 KiB |
| Urban |
|---|
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 11855 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Urban |
|---|---|
| 2nd row | Urban |
| 3rd row | Urban |
| 4th row | Urban |
| 5th row | Urban |
Common Values
| Value | Count | Frequency (%) |
| Urban | 2371 | |
| (Missing) | 179 | 7.0% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| urban | 2371 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 2371 | |
| r | 2371 | |
| b | 2371 | |
| a | 2371 | |
| n | 2371 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9484 | |
| Uppercase Letter | 2371 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2371 | |
| b | 2371 | |
| a | 2371 | |
| n | 2371 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 2371 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11855 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 2371 | |
| r | 2371 | |
| b | 2371 | |
| a | 2371 | |
| n | 2371 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11855 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 2371 | |
| r | 2371 | |
| b | 2371 | |
| a | 2371 | |
| n | 2371 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 118 |
| Missing (%) | 4.6% |
| Memory size | 20.0 KiB |
| Urban |
|---|
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 12160 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Urban |
|---|---|
| 2nd row | Urban |
| 3rd row | Urban |
| 4th row | Urban |
| 5th row | Urban |
Common Values
| Value | Count | Frequency (%) |
| Urban | 2432 | |
| (Missing) | 118 | 4.6% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| urban | 2432 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 2432 | |
| r | 2432 | |
| b | 2432 | |
| a | 2432 | |
| n | 2432 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9728 | |
| Uppercase Letter | 2432 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2432 | |
| b | 2432 | |
| a | 2432 | |
| n | 2432 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 2432 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12160 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 2432 | |
| r | 2432 | |
| b | 2432 | |
| a | 2432 | |
| n | 2432 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12160 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 2432 | |
| r | 2432 | |
| b | 2432 | |
| a | 2432 | |
| n | 2432 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 65 |
| Missing (%) | 2.5% |
| Memory size | 20.0 KiB |
| Urban |
|---|
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 12425 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Urban |
|---|---|
| 2nd row | Urban |
| 3rd row | Urban |
| 4th row | Urban |
| 5th row | Urban |
Common Values
| Value | Count | Frequency (%) |
| Urban | 2485 | |
| (Missing) | 65 | 2.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| urban | 2485 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 2485 | |
| r | 2485 | |
| b | 2485 | |
| a | 2485 | |
| n | 2485 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9940 | |
| Uppercase Letter | 2485 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2485 | |
| b | 2485 | |
| a | 2485 | |
| n | 2485 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 2485 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12425 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 2485 | |
| r | 2485 | |
| b | 2485 | |
| a | 2485 | |
| n | 2485 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12425 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 2485 | |
| r | 2485 | |
| b | 2485 | |
| a | 2485 | |
| n | 2485 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 20.0 KiB |
| Urban |
|---|
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 12750 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Urban |
|---|---|
| 2nd row | Urban |
| 3rd row | Urban |
| 4th row | Urban |
| 5th row | Urban |
Common Values
| Value | Count | Frequency (%) |
| Urban | 2550 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| urban | 2550 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 2550 | |
| r | 2550 | |
| b | 2550 | |
| a | 2550 | |
| n | 2550 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10200 | |
| Uppercase Letter | 2550 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2550 | |
| b | 2550 | |
| a | 2550 | |
| n | 2550 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 2550 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12750 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 2550 | |
| r | 2550 | |
| b | 2550 | |
| a | 2550 | |
| n | 2550 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12750 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 2550 | |
| r | 2550 | |
| b | 2550 | |
| a | 2550 | |
| n | 2550 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | LAT | LON | 2013 | 2014 | 2015 | 2016 | 2017 | 2018 | 2019 | 2020 | LABEL2013 | LABEL2014 | LABEL2015 | LABEL2016 | LABEL2017 | LABEL2018 | LABEL2019 | LABEL2020 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 107 | 17.3775 | 78.04750 | 0.46807 | 0.50665 | 0.46639 | 0.45427 | 0.47699 | 0.45679 | 0.42616 | 0.50689 | Urban | Urban | Urban | Urban | Urban | Urban | Urban | Urban |
| 1 | 108 | 17.3825 | 78.04750 | 0.43132 | 0.45377 | 0.43575 | 0.41307 | 0.43716 | 0.40492 | 0.36429 | 0.46852 | Urban | Urban | Urban | Urban | Urban | Urban | Urban | Urban |
| 2 | 143 | 17.3775 | 78.05251 | 0.46602 | 0.50911 | 0.46874 | 0.45344 | 0.48692 | 0.45935 | 0.42363 | 0.50702 | Urban | Urban | Urban | Urban | Urban | Urban | Urban | Urban |
| 3 | 246 | 17.3325 | 78.06250 | 0.38557 | 0.41282 | 0.37643 | 0.38042 | 0.41350 | 0.36358 | 0.37119 | 0.42814 | Urban | Urban | Urban | Urban | Urban | Urban | Urban | Urban |
| 4 | 247 | 17.3375 | 78.06250 | 0.40016 | 0.40265 | 0.38272 | 0.38156 | 0.42316 | 0.35924 | 0.38613 | 0.43244 | Urban | Urban | Urban | Urban | Urban | Urban | Urban | Urban |
| 5 | 347 | 17.6225 | 78.06750 | 0.49796 | 0.51019 | 0.49874 | 0.43338 | 0.48666 | 0.48804 | 0.44542 | 0.48810 | Urban | Urban | Urban | Urban | Urban | Urban | Urban | Urban |
| 6 | 348 | 17.6275 | 78.06750 | 0.51661 | 0.51754 | 0.50118 | 0.44012 | 0.49441 | 0.49761 | 0.44885 | 0.47394 | Urban | Urban | Urban | Urban | Urban | Urban | Urban | Urban |
| 7 | 425 | 17.6075 | 78.07250 | 0.38281 | 0.39561 | 0.37005 | 0.33364 | 0.37499 | 0.37456 | 0.36513 | 0.38423 | Urban | Urban | Urban | Urban | Urban | Urban | Urban | Urban |
| 8 | 426 | 17.6125 | 78.07250 | 0.41392 | 0.43616 | 0.41272 | 0.36366 | 0.39584 | 0.38924 | 0.37742 | 0.40263 | Urban | Urban | Urban | Urban | Urban | Urban | Urban | Urban |
| 9 | 427 | 17.6175 | 78.07250 | 0.46341 | 0.48440 | 0.47281 | 0.42763 | 0.45675 | 0.44923 | 0.43628 | 0.46816 | Urban | Urban | Urban | Urban | Urban | Urban | Urban | Urban |
Last rows
| df_index | LAT | LON | 2013 | 2014 | 2015 | 2016 | 2017 | 2018 | 2019 | 2020 | LABEL2013 | LABEL2014 | LABEL2015 | LABEL2016 | LABEL2017 | LABEL2018 | LABEL2019 | LABEL2020 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2540 | 25379 | 17.5225 | 78.89250 | 0.33712 | 0.34544 | 0.32894 | 0.31398 | 0.34476 | 0.33625 | 0.32800 | 0.35803 | Urban | Urban | Urban | Urban | Urban | Urban | Urban | Urban |
| 2541 | 25380 | 17.5275 | 78.89250 | 0.39917 | 0.41380 | 0.37952 | 0.38031 | 0.40378 | 0.39219 | 0.37231 | 0.43909 | Urban | Urban | Urban | Urban | Urban | Urban | Urban | Urban |
| 2542 | 25381 | 17.5325 | 78.89250 | 0.40976 | 0.42836 | 0.39547 | 0.38901 | 0.42572 | 0.40216 | 0.38748 | 0.49498 | Urban | Urban | Urban | Urban | Urban | Urban | Urban | Urban |
| 2543 | 25382 | 17.5375 | 78.89250 | 0.39758 | 0.41259 | 0.38598 | 0.36814 | 0.41094 | 0.39154 | 0.38673 | 0.48909 | Urban | Urban | Urban | Urban | Urban | Urban | Urban | Urban |
| 2544 | 25399 | 17.2525 | 78.89750 | 0.33924 | 0.31975 | 0.29171 | 0.28282 | 0.31751 | 0.29269 | 0.29810 | 0.33698 | Urban | Urban | Urban | Urban | Urban | Urban | Urban | Urban |
| 2545 | 25453 | 17.5225 | 78.89750 | 0.37148 | 0.37623 | 0.35456 | 0.33444 | 0.37231 | 0.35786 | 0.35958 | 0.38779 | Urban | Urban | Urban | Urban | Urban | Urban | Urban | Urban |
| 2546 | 25455 | 17.5325 | 78.89750 | 0.40606 | 0.42729 | 0.38914 | 0.38707 | 0.41392 | 0.38866 | 0.39421 | 0.47712 | Urban | Urban | Urban | Urban | Urban | Urban | Urban | Urban |
| 2547 | 25473 | 17.2525 | 78.90250 | 0.36136 | 0.33457 | 0.30854 | 0.30166 | 0.35136 | 0.31615 | 0.30267 | 0.35836 | None | None | None | None | None | None | None | Urban |
| 2548 | 25774 | 17.4625 | 78.92250 | 0.41424 | 0.40772 | 0.38819 | 0.36489 | 0.43117 | 0.42993 | 0.40344 | 0.48538 | Urban | Urban | Urban | Urban | Urban | Urban | Urban | Urban |
| 2549 | 25800 | 17.2325 | 78.92751 | 0.39387 | 0.37387 | 0.36644 | 0.39412 | 0.44541 | 0.41260 | 0.33360 | 0.40543 | None | None | None | None | None | None | None | Urban |